Passage-level Evidence for Cross-Language Information Retrieval
نویسندگان
چکیده
Machine translation (MT) techniques can be used to generate a query in a target language from a query in a source language for the cross-language information retrieval (CLIR). Recent MT systems have advanced enough to generate translations which are human-readable, However, translation error is still a serious impediment which hurts the effectiveness of a CLIR system. To compensate for defects in a machinetranslation result, we propose a method using passage-level evidence. By combining a document retrieval model with a passage retrieval model, we prevent the retrieval model from assigning a high score to a non-relevant document because of translation error. The retrieval model incorporating passage retrieval shows better results than a document retrieval model. In particular, the passage retrieval model achieves more improvement when the translation quality of queries is relatively low.
منابع مشابه
Two Approaches for Multilingual Question Answering: Merging Passages vs. Merging Answers
One major problem in multilingual Question Answering (QA) is the integration of information obtained from different languages into one single ranked list. This paper proposes two different architectures to overcome this problem. The first one performs the information merging at passage level, whereas the second does it at answer level. In both cases, we applied a set of traditional merging stra...
متن کاملEnhancing Relevance Models with Adaptive Passage Retrieval
Passage retrieval and pseudo relevance feedback/query expansion have been reported as two effective means for improving document retrieval in literature. Relevance models, while improving retrieval in most cases, hurts performance on some heterogeneous collections. Previous research has shown that combining passage-level evidence with pseudo relevance feedback brings added benefits. In this pap...
متن کاملBoosting Passage Retrieval through Reuse in Question Answering
Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...
متن کاملUtilizing Passage-Based Language Models for Document Retrieval
We show that several previously proposed passage-based document ranking principles, along with some new ones, can be derived from the same probabilistic model. We use language models to instantiate specific algorithms, and propose a passage language model that integrates information from the ambient document to an extent controlled by the estimated document homogeneity. Several document-homogen...
متن کاملImpact of Controlled and Free Language Use in Retrieving Articles from the ProQuest and Science Direct Databases
Abstract Introduction: The growth and expansion of the Internet has changed the way information is accessed and many facilities have been created on the Web to facilitate and expedite information locating. Objective: To identify the impact of keyword documentation using the medical thesaurus on the retrieval of articles from Proquest and Science Direct databases. Materials and Methods:The pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012